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Report  Title 

Real-Time  Distributed  Implementation  of  Interference  Alignment  with  Analog  Feedback 

ABSTRACT 

Interference  alignment  (IA)  is  known  to  achieve  the 
maximum  capacity  for  the  interference  channel.  IA  is  a  precoding 
technique  to  make  interfering  signals  be  aligned  at  receivers.  The 
resulting  sum  rate  is  linearly  scaled  with  the  number  of  users. 

The  performance  of  IA,  however,  depends  on  the  practical  issues 
such  as  the  performance  of  synchronization,  channel  estimation 
and  feedback.  In  this  paper,  a  prototype  is  implemented  for  the 
IA  system  with  three  users.  There  have  been  IA  prototypes  in 
recent  years,  but  the  previous  prototypes  have  not  considered  the 
distribution  of  the  nodes  in  IA  network.  The  nodes  are  physically 
distributed  in  our  prototype  not  sharing  their  time  and  frequency 
references  with  any  other,  thus  working  independently,  which 
enables  the  experimental  study  of  IA  under  the  most  practical 
setup.  For  the  distributed  system,  the  over-the-air  schemes  for 
time  and  frequency  synchronization  and  analog  feedback  are 
studied  and  implemented.  According  to  the  measurement  from 
our  prototype,  it  is  shown  that  IA  achieves  the  sum  rate  from  the 
previous  analysis  on  imperfect  channel  information.  In  addition, 
some  other  measurements  are  performed  considering  the  accuracy 
of  LA  solution,  synchronization  of  nodes,  and  CSI  feedback. 

For  the  accuracy  of  IA  solution,  the  performance  of  IA  versus 
the  number  of  iterations  for  an  iterative  IA  method  is  measured. 

For  synchronization  accuracy,  the  performance  with  different 
residual  frequency  offset  is  measured.  Finally,  for  the  CSI 
feedback  quality,  analog  feedback  and  scaler  quantization-based 
limited  feedback  is  first  compared,  and  then  the  performance 
with  Doppler  frequency  offset  is  also  measured. 
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Abstract — Interference  alignment  (IA)  is  known  to  achieve  the 
maximum  capacity  for  the  interference  channel.  IA  is  a  precoding 
technique  to  make  interfering  signals  be  aligned  at  receivers.  The 
resulting  sum  rate  is  linearly  scaled  with  the  number  of  users. 
The  performance  of  IA,  however,  depends  on  the  practical  issues 
such  as  the  performance  of  synchronization,  channel  estimation 
and  feedback.  In  this  paper,  a  prototype  is  implemented  for  the 
IA  system  with  three  users.  There  have  been  IA  prototypes  in 
recent  years,  but  the  previous  prototypes  have  not  considered  the 
distribution  of  the  nodes  in  IA  network.  The  nodes  are  physically 
distributed  in  our  prototype  not  sharing  their  time  and  frequency 
references  with  any  other,  thus  working  independently,  which 
enables  the  experimental  study  of  IA  under  the  most  practical 
setup.  For  the  distributed  system,  the  over-the-air  schemes  for 
time  and  frequency  synchronization  and  analog  feedback  are 
studied  and  implemented.  According  to  the  measurement  from 
our  prototype,  it  is  shown  that  IA  achieves  the  sum  rate  from  the 
previous  analysis  on  imperfect  channel  information.  In  addition, 
some  other  measurements  are  performed  considering  the  accu¬ 
racy  of  IA  solution,  synchronization  of  nodes,  and  CSI  feedback. 
For  the  accuracy  of  IA  solution,  the  performance  of  IA  versus 
the  number  of  iterations  for  an  iterative  IA  method  is  measured. 
For  synchronization  accuracy,  the  performance  with  different 
residual  frequency  offset  is  measured.  Finally,  for  the  CSI 
feedback  quality,  analog  feedback  and  scaler  quantization-based 
limited  feedback  is  first  compared,  and  then  the  performance 
with  Doppler  frequency  offset  is  also  measured. 

Index  Terms — Interference  alignment,  prototyping,  analog 
feedback. 

I.  Introduction 

Interference  management  plays  a  crux  role  in  current  wire¬ 
less  technologies  since  it  allows  the  efficient  use  of  limited 
time  and  frequency  resources.  Time  division  multiple  access 
(TDMA)  and  frequency  division  multiple  access  (FDMA)  have 
been  used  as  simple  interference  management  techniques  in 
the  past  wireless  systems.  Due  to  the  inherent  nature  of  ineffi¬ 
ciency  in  the  spectrum  usage,  however,  they  cannot  support  the 
evolving  wireless  systems  that  require  high  spectral  efficiency. 
This  motivates  the  study  for  effective  interference  management 
methods  using  power  control  or  precoding  techniques  that 
improve  the  spectrum  efficiency  significantly,  especially  when 
the  multiple  user  pairs  communicate  concurrently  using  shared 
time  and  frequency. 

The  concept  of  interference  alignment  (IA)  is  first  proposed 
for  multiple  input  and  multiple  output  (MIMO)  X  channel  in 
[1].  The  key  idea  of  IA  is  to  make  the  multiple  interference 
streams  be  aligned  at  a  receiver  so  that  the  dimensions  for  the 
subspace  occupied  by  the  interference  streams  are  minimized 
while  ensuring  the  independence  of  the  desired  streams.  In 
[1],  dirty  paper  coding  (DPC)  at  transmitters  and  successive 


interference  cancellation  (SIC)  at  receivers  are  combined  to 
achieve  optimal  multiplexing  gain.  As  a  result,  two  interfering 
streams  are  aligned  in  one  antenna  subspace  and  the  desired 
streams  are  received  without  interference  in  the  other  antenna 
subspaces.  In  [2],  it  is  shown  that  the  same  degree  of  freedom 
(DOF)  can  be  achieved  also  for  the  MIMO  X  channel  with 
zero  forcing  linear  technique  instead  of  non-linear  DPC  and 
SIC. 

IA  has  also  extended  to  K-user  interference  channel  in  [3]. 
The  separation  of  the  desired  streams  and  the  interference 
streams  is  done  by  linear  precoding  at  transmitters  and  decod¬ 
ing  at  receivers.  For  K-user  system,  the  DOF  can  be  linearly 
increased  with  the  number  of  users.  This  is  the  main  difference 
of  IA  from  the  other  orthogonal  multiple  access  schemes 
where  the  DOF  is  fixed  regardless  of  the  number  of  users.  The 
subspace  for  IA  can  be  time,  frequency  or  antenna  subspace. 
It  is  shown  in  [3],  however,  that  to  use  multiple  antennas  is  the 
most  efficient  way  to  form  the  required  dimensions  for  IA.  In 
[4],  [5],  the  feasibility  conditions  on  the  number  of  antennas 
for  I A  is  studied. 

The  core  task  of  IA  is  to  find  precoding  vectors.  Several 
schemes  exist  to  find  the  IA  precoding  vectors  :  the  closed 
form  method  in  [3]  and  the  iterative  methods  in  [6]  and  [7]. 
The  closed  form  solution  in  [3]  is  simple,  but  only  exists  for 
three  users,  while  the  iterative  methods  can  be  used  with  any 
number  of  users.  In  [6],  by  assuming  feedforward  and  feed¬ 
back  channels’  reciprocity  in  time  division  dumplex  (TDD) 
system,  the  iteration  is  done  between  the  transmitter  side  and 
the  receiver  side,  and  IA  is  used  for  both  the  feedforward  data 
and  the  feedback  data.  Two  optimization  problems  are  also 
suggested  in  [6]  for  the  iterative  methods  :  one  to  minimize 
total  interference  leakage,  and  the  other  to  maximize  signal 
to  interference  and  noise  ratio  (SINR).  The  key  in  [6]  is  that 
the  algorithms  do  not  need  channel  state  information  (CSI) 
feedback,  and  can  be  done  in  a  fully  distributed  manner  due 
to  the  channel  reciprocity.  In  [7],  the  iteration  is  performed 
at  either  the  transmitter  side  or  the  receiver  side,  not  between 
them.  The  base  algorithm  in  [7]  is  the  same  with  the  algorithm 
that  minimizes  total  leakage  in  [6],  but  there  is  no  assumption 
for  the  channel  reciprocity  and  the  algorithm  in  [7]  requires 
CSI  feedback. 

Although  IA  provides  a  promising  capacity  gain  in  the 
interference  network,  it  depends  strongly  on  the  assumptions 
such  as  perfect  synchronization  and  CSI  at  transmitter  (CSIT), 
which  are  infeasible  in  practice.  IA  considering  these  practi¬ 
cal  issues  are  studied  with  both  analytical  and  experimental 
approaches. 

For  the  analytical  approaches,  the  CSI  feedback  quality  is 


2 


studied  when  limited  feedback  [8]  or  analog  feedback  [9] 
is  used.  Furthermore,  IA’s  performance  in  imperfect  channel 
information  is  also  studied  in  [10].  In  [10],  it  is  assumed  that 
the  CSI  feedback  is  given  with  analog  feedback,  and  IA’s 
performance  with  imperfect  feedback  is  analyzed  with  the  CSI 
errors  caused  by  the  channel  estimation  error  in  feedforward 
channel,  the  channel  estimation  error  in  feedback  channel,  and 
the  decoding  error  of  CSI  data  in  feedback  channel. 

Some  work  has  studied  the  feasibility  with  the  experimental 
approaches.  In  [11],  [12],  [13]  and  [14],  IA  prototypes  are  built 
and  the  performance  of  IA  is  measured  and  studied  from  them. 
In  [11],  the  feasible  setups  for  IA  for  wireless  LAN  system  and 
the  imperfect  practical  issues  such  as  synchronization  errors 
are  studied.  In  [12],  the  wireless  channel  is  measured  from 
the  authors’  prototype,  and  the  sum  rate  is  calculated  from  it 
mainly  showing  that  the  gain  in  sum  rate  achieves  under  the 
measured  channel.  In  [13],  [14],  the  CSI  is  measured  at  the 
receivers,  precoding  vectors  are  obtained  with  the  CSI,  and  the 
sum  rate  is  calculated  from  over-the-air  precoded  transmission. 

The  main  contribution  of  this  paper  is  the  first  implemen¬ 
tation  and  measurement  of  real-time  distributed  IA  system. 
Though  there  have  been  the  experimental  studies  of  IA  in 
literature,  none  has  considered  the  practical  issues  when  the 
nodes  of  IA  are  physically  distributed,  and  consequently  the 
system  goes  with  overhead  and  accuracy  loss  in  sharing 
CSI.  Over-the-air  time  and  frequency  synchronization  and 
feedback  mechanisms  are  implemented  to  achieve  this  goal. 
From  the  prototype,  the  sum  rate  is  measured  showing  that 
IA  has  a  significant  gain  over  the  issues.  Also,  some  other 
measurements  are  performed  considering  the  accuracy  of  IA 
solution,  synchronization  and  CSI  feedback  mainly  showing 
the  relationship  between  overhead  and  performance. 

This  paper  is  organized  as  follows  :  Section  II  briefly 
describes  system  model  and  background.  Section  III  presents 
our  approaches  for  distributed  IA  system.  Section  IV  shows 
the  implementation  details  and  the  results.  Finally,  Section  V 
concludes  this  paper. 

Throughout  this  paper,  the  following  notations  are  used.  K 
is  the  number  of  users  in  IA  network,  Nt  and  is  the  number 
of  antennas  at  a  transmitter  and  Nr  is  the  number  of  antennas 
at  a  receiver.  Ns  is  the  number  of  the  desired  data  streams 
for  each  user’s  transmitter  and  receiver  pair.  supn  is  the  least 
upper  bound  of  set  n. 

II.  System  model  and  background 

Our  system  models  K-user  interference  channel.  For  the 
interference  channel,  there  is  one  by  one  mapping  of  a 
transmitter  and  a  receiver  as  the  example  with  Ns  =  1,  Nt  =  2 
and  Nr  =  2  in  Figure  1.  Only  the  data  streams  from  z-th 
transmitter  is  the  desired  streams  to  / - 1 h  receiver,  and  the 
streams  from  the  other  transmitters  are  the  interfering  streams. 
Each  of  the  transmitters  and  the  receivers  has  Nt  and  Nr 
antennas  respectively,  and  all  nodes  are  separated.  Assuming 
narrowband  block  fading  channel,  the  received  signal  yy  at 
receiver  node  fc  is  given  by, 

y k  =  T  'y  j  H^mFmdm  T  (1) 

m^k 
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Fig.  1.  Block  diagram  of  2  X  2  MIMO  3  user  system. 


where  Hk,m  is  Nr  x  Nt  channel  matrix  between  fc-th 
transmitter  and  m-th  receiver,  F ^  is  Nr  x  Ns  precoding  vector, 
d/,  is  Ns  x  1  data  stream  for  fc-th  user  pair,  and  ri/.  is  additive 
white  Gaussian  noise  (AWGN)  at  receiver  fc.  The  elements  in 
is  independent  identically  distributed  (i.i.d.). 

A.  Interference  alignment 

IA  is  a  linear  precoding  technique  that  maximizes  multi¬ 
plexing  gain  for  the  interference  channel.  By  precoding  at 
the  transmitters,  multiple  interfering  streams  are  aligned  in 
the  same  subspaces  at  the  receiver  side,  and  the  interfering 
streams’  subspaces  are  independent  of  the  desired  streams’ 
subspaces.  With  a  zero  forcing  decoder  at  the  receivers,  the 
received  signal  becomes. 


Vk  =  WfHA.,fcFfcdA;+^  WfHfc)mFmdm+Wfnfe.  (2) 

m^k 

Here,  W/,:  is  Ay  x  Ay  decoding  vector  at  receiver  fc.  The 
first  term  from  (2)  is  the  desired  signal  after  decoding  which 
has  the  rank  of  Ns  while  the  second  term  is  the  interference 
and  becomes  zero. 


Wf  Hfc,mFm  =  0  (3) 

Wf  Hfc,fcFfc  >  c  >  0  (4) 

The  sum  rate  Rsum  of  IA  system  is  defined  by  the  ratio  of 
the  desired  signal’s  power  and  the  sum  of  interference  leakage 
power  and  noise  power  as  follows. 
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(5) 


With  infinite  symbol  extension  in  time  or  frequency  domain, 
the  DOF  of  IA  is  K/2  as. 


(n  +  1)N  +  (K  —  1  )nN  K 

{n+l)N  +  nN  ~  T 


(6) 


where  n  is  any  positive  integer  and  N  =  {K  —\)(K —2)  —  1. 
With  multiple  antennas,  the  DOF  depends  on  the  antenna 
configuration  and 
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min  (Nt,  Nr) 


R 

R+l 


K. 


(7) 


Here,  R  = 


max(Nt,Nr ) 
min(Nt,Nr )  ' 


There  are  two  different  approaches  to  find  the  solution 
for  linear  IA  precoding  vectors  :  the  closed  form  solution 
[3]  and  the  iterative  methods  [6]  and  [7].  The  closed  form 
solution  is  available  only  for  three  user  system.  For  the  closed 
form  solution,  the  interfering  streams  from  transmitter  two  and 
transmitter  three  should  be  aligned  at  receiver  one  as  follows. 


is  required  for  the  iterative  methods  because  of  the  eigenvector 
calculation  at  every  iteration. 

For  the  decoding  vectors  at  the  receivers,  the  minimum 
mean  squared  error  (MMSE)  decoding  vector  is  presented  in 
[15],  The  decoding  vector  Wfc  is 

K 

Wfe  =  +  cr2I)_1Hfei/cFfc  (13) 

k— 1 

where  a2  is  noise  variance. 


H1j2F2  =  Hi>3F3  (8) 

Then,  F3  can  be  obtained  as 

F3  =  H^H1i2F2.  (9) 

Similarly  at  receiver  three,  interfering  streams  from  trans¬ 
mitter  one  and  transmitter  two  should  be  aligned,  and  Fi  is 

Fr  =  H3]H3,2F2.  (10) 

At  receiver  two,  interfering  streams  from  transmitter  one 
and  transmitter  three  should  be  aligned,  and  together  with  (9) 
and  (10),  F2  is  any  eigenvector  of  E,  where  E  is, 

E  =  H3jH3ilH2-jH2)3HrjHlj2.  (11) 

The  iterative  methods  find  IA  solution  by  numerical  ap¬ 

proaches.  The  iteration  happens  between  transmitter  side  and 
receiver  side  by  utilizing  the  channel  reciprocity  [6],  or  at 
either  transmitter  side  or  receiver  side  [7],  There  are  two 
optimization  problems  in  [6]  :  minimizing  interference  leakage 
and  maximizing  SINR.  The  optimization  problem  to  minimize 
interference  leakage  is  also  used  in  [7]  as  follows. 


I< 

max  V  V  ||Hfe,mFm  -  WfeW£Hfc!mFm||2 

WfcWfc=I«fc-Ni,  m^k 

(12) 

The  following  procedure  is  performed  to  solve  the  above 
problem. 

1)  Choose  the  set  of  Fm  randomly. 

2)  Choose  the  columns  of  W /,  to  be  the  Nj.  Ns  dominant 
eigenvectors  of  Hfc,mFmF^H *k  m  Mk. 

m^k 

3)  Choose  the  columns  of  Fm  to  be  the  Ns  least  dominant 
eigenvectors  of  ]C  H^m(Ijvfc  -  WfeW£)Hfc)m  Mm. 

m^k 

4)  Repeat  steps  2)  and  3)  until  convergence. 

Both  the  precoding  vector  set  {Fm}  and  the  decoding  vector 
set  { W;,. }  are  obtained  from  the  above  procedure.  The  conver¬ 
gence  is  guaranteed,  but  the  cost  function  (12)  is  not  a  convex 
function,  so  the  algorithm  can  fall  into  local  minimums.  Dif¬ 
ferent  from  the  closed  form  solution,  the  iterative  methods  can 
be  used  for  any  number  of  users  in  IA  system,  and  even  can  be 
used  for  distributed  IA  system  without  a  dedicated  feedback 
channel  if  the  system  uses  TDD  by  utilizing  the  reciprocity  of 
wireless  channel.  A  heavy  computational  complexity,  however. 


B.  Synchronization 

The  above  basic  analysis  of  IA  assumes  perfect  synchro¬ 
nization  and  perfect  CSI  measurement  and  feedback,  which 
are  not  feasible  in  real  world.  The  requirement  for  time  and 
frequency  synchronization  for  IA  is  studied  in  [11],  It  is 
known  from  [11]  that  the  phase  offsets  caused  by  time  and 
frequency  synchronization  errors  is  not  a  problem  for  IA. 
It  is  because  the  phase  error  in  synchronization  does  not 
affect  the  antenna  subspaces  where  IA  works.  Besides  this 
requirement  in  synchronization,  a  protocol  for  over-the-air 
nodes’  synchronization  is  required  for  the  distributed  system. 

There  are  two  types  of  synchronization  for  the  distributed 
IA  system  :  synchronization  among  transmitters  and  synchro¬ 
nization  between  the  transmitter  side  and  the  receiver  side. 
Once  the  former  is  achieved,  then  the  latter  becomes  a  point  to 
point  synchronization  that  is  well  studied  in  literature.  Thus, 
the  main  concern  on  synchronization  is  the  synchronization 
among  transmitters. 

For  our  interference  channel  model,  all  the  transmitters  use 
the  same  time  and  frequency  resources  to  send  their  own 
streams,  so  the  multiple  streams  are  overlapped  at  the  re¬ 
ceivers.  If  the  transmitters  are  asynchronous,  then  the  receivers 
have  to  estimate  and  compensate  different  time  and  frequency 
offsets  for  each  of  the  transmitted  streams,  and  it  is  an 
expensive  task  for  the  receivers  as  an  example  of  asynchronous 
multiuser  MIMO  system  in  [16].  Otherwise,  if  the  transmitters 
are  globally  synchronized,  the  receivers  need  to  estimate  and 
compensate  only  the  global  time  and  frequency  offsets  for 
all  the  overlapped  streams,  and  it  has  lower  synchronization 
overhead  than  the  case  of  the  asynchronous  transmitters. 

For  the  transmitter  synchronization,  master-slave  synchro¬ 
nization  protocols  have  been  proposed  in  [17],  [18]  and 
[19],  In  [17],  a  precise  symbol-level  time  synchronization 
protocol  is  introduced  for  wireless  LAN  systems,  and  in 
[18]  a  frequency  synchronization  method  based  on  the  time 
synchronization  protocol  in  [17]  is  also  presented.  Figure  2(a) 
shows  an  example  of  the  system  that  has  different  propagation 
delays  between  any  two  nodes.  The  protocol  in  [17]  assumes 
different  propagation  delays,  and  one  of  the  transmitters  works 
as  the  master  transmitter,  and  the  others  become  the  slave 
transmitters,  and  the  goal  is  to  align  the  transmitters  in  time 
and  frequency  so  that  the  signals  from  all  the  transmitters 
arrive  as  close  as  possible  in  time  and  frequency  at  the 
receivers. 

To  acquire  the  propagation  delays  and  the  other  internal 
delays  in  a  node  for  this  goal,  each  of  the  transmitters 


4 


Dj :  Propagation  delay  between  the  master  TX  and  the  slave  TX  i 
Tp,nm :  Propagation  delay  between  RX  n  and  TX  m 


(a)  System  with  different  propagatin  delays. 


TX  sends  probe 
RX  receives  probe 
RX  detects  probe 
RX  transmits  feedback 
TX  receives  feedback 
TX  detects  feedback 


(b)  Acquisition  of  one  propagation  delay. 


Fig.  2.  Over-the-air  master-slave  synchronization  protocol. 


periodically  sends  a  probe  to  the  others.  Figure  2(b)  shows  the 
possible  delay  elements  and  the  method  how  to  obtain  them. 
One  transmitter  sends  the  probe  (This  transmitter  is  denoted  as 
’TX’  in  Figure  2(b)),  and  then  each  of  the  other  transmitters 
and  the  receivers  (It  is  denoted  as  ’RX’  in  Figure  2(b))  senses 
the  probe,  measures  the  internal  processing  delay  to  detect  the 
probe  f/'/jj  and  the  turnaround  delay  from  receiving  mode  to 
transmitting  mode  ( Tt ),  informs  the  sum  of  those  delays  back 
to  the  transmitter  that  sends  the  probe.  Then,  the  transmitter 
receives  this  feedback  from  each  of  the  nodes,  and  measures 
the  air  propagation  delays  ( Tp )  with  its  internal  detection 
time  (Tp)  and  the  delay  information  (T^  +  Tt)  given  by  the 
feedback  node.  The  other  transmitters  perform  the  same  task 
in  turn. 

When  all  the  required  delay  information  is  obtained,  the 
system  is  ready  to  work  in  synchronous  manner.  The  master 
transmitter  sends  its  training  to  the  other  transmitters,  and  each 
of  the  other  transmitters  adjusts  the  time  with  the  previously 
measured  delay  information  to  send  its  own  packet.  After  some 
time  margin  to  this  adjustment,  all  transmitters  send  their 
synchronized  packets  to  the  receivers.  If  multiple  receivers 
exist  in  the  system,  however,  it  is  impossible  to  make  all 
receivers  receive  the  transmitters’  signals  at  the  exactly  same 
time.  Instead,  the  transmitters  adjust  their  packet  transmission 
time  so  that  the  sum  timing  error  at  all  the  receivers  to  be 
minimized,  e.g.  all  the  signals  are  to  be  received  within  the 
cyclic  prefix  of  orthogonal  frequency  division  multiplexing 
(OFDM)  system. 

For  frequency  synchronization,  the  slave  transmitters  mea¬ 
sure  the  frequency  offset  with  the  master’s  training,  recover  the 


offset  before  they  send  their  data  packet  [18].  Along  with  the 
frequency  offset,  the  phase  offset  between  the  master  and  the 
slaves  are  also  considered  in  [18].  The  phase  offset  caused  by 
oscillator  offset  is  inevitable,  and  it  may  affect  the  performance 
of  wireless  systems.  To  enable  the  slaves  to  measure  the  phase 
difference  from  the  master,  two  identical  trainings  are  sent 
from  the  master  :  one  at  the  first  transmission  of  the  synchro¬ 
nization  training  and  the  other  at  actual  data  transmission. 
The  slaves  detect  these  two  trainings  and  compare  them  to 
estimate  phase  offset  between  two.  If  the  residual  frequency 
offset  is  small,  it  can  be  assumed  that  there  is  only  the  constant 
phase  offset  between  the  master  and  a  slave.  The  slaves  also 
compensate  the  phase  offset  before  their  data  transmission. 

In  [17],  [18],  TDD  is  assumed,  but  there  is  also  a  scheme 
that  assumes  frequency  division  multiplexing  (FDD)  and  con¬ 
tinuous  transmission  [19].  In  [19],  the  master  continuously 
sends  its  OFDM-based  training  to  the  slaves  via  a  dedicated 
synchronization  channel,  and  the  phase  rotation  of  OFDM 
subcarriers  caused  by  time  and  frequency  offset  from  the 
master  is  measured  and  compensated  by  the  slaves  before  they 
send  their  own  data  stream. 


C.  CSI  feedback 

There  are  three  major  error  types  in  CSI  feedback  :  the  error 
caused  by  AWGN,  the  limited  feedback  error  and  the  delayed 
feedback  error.  The  limited  feedback  error  occurs  by  the 
quantization  of  the  estimated  CSI,  and  the  delayed  feedback 
error  is  by  channel’s  time-varying  property.  IA  with  limited 
feedback  is  first  analyzed  in  [8],  In  [8],  the  CSI  quantization 
error  with  Grassmannian  codebook  [20]  is  analyzed  assuming 
perfect  CSI  measurment  and  feedback.  According  to  its  result, 
the  DOF  of  K/ 2  for  single  input  and  single  output  IA  system 
is  achievable  if  the  number  of  total  feedback  bits  is  larger 
than  K(L  —  1  )logPf,  where  L  is  the  number  of  multipaths 
in  feedforward  channel  and  Pf  is  the  trasnmit  power  of  CSI 
feedback. 

The  analog  feedback  with  IA  is  presented  in  [9]  to  avoid 
the  quantization  error.  In  [9],  the  feedback  channel  estimation 
error  by  AWGN  at  feedback  training  and  the  feedback  data 
error  caused  by  AWGN  at  data  symbols  are  considered.  It 
is  shown  that  the  multiplexing  gain  of  IA  is  preserved  only 
with  a  constant  sum  rate  loss  when  the  transmitting  power 
of  feedback  linearly  relates  to  the  transmitting  power  of 
feedforward.  The  upper  bound  of  the  constant  sum  rate  loss 
A RSum  is  given  as  the  function  of  the  length  of  the  training 
for  feedback  channel  estimation  (rp),  the  length  of  the  CSI 
feedback  (tc),  and  the  ratio  of  Pf  and  the  transmit  power  P 
of  feedfoward  channel. 


p  i 

A Rsum{jp,Tc)  <  ^2  dilog2(l+  —c(tp,  tc)(K -  (14) 

n  ai 

Here,  c(tp,tc)  is  the  value  that  depends  on  tp  and  rc,  and 
if  either  tp  or  rc  gets  larger,  c(rp,rc)  gets  smaller  showing 
that  with  more  overhead  in  the  feedback  channel  estimation 
training  and  feedback  data,  the  sum  rate  loss  A Rsum  gets 
smaller.  To  maintain  the  multiplexing  gain,  Pf  needs  to 
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linearly  scale  with  P,  i.e.  Pf  =  aP.  Otherwise,  if  Pf  is  not 
linear  with  P  as  Pf  =  aP13,  there  exists  a  loss  in  multiplexing 
gain  according  to  3.  A  similar  extended  study  also  considering 
the  feedforward  channel  estimation  error  is  presented  in  [10]. 

III.  System  Realization 

We  are  targeting  2x2  MIMO  three  user  IA  system  as  Figure 
1.  Nt  =  Nr  =  2  for  all  user  pairs,  Ns  =  1,  and  I\  =  3. 
OFDM  is  used  for  both  feedforward  and  feedback  channels. 
With  this  setup,  two  antenna  subspaces  are  available  for  a  user 
pair,  and  two  interfering  streams  are  aligned  in  one  subspace 
at  each  receiver  while  the  desired  stream  is  received  in  the 
other  subspace. 


A.  IA  algorithms 

Both  the  closed  form  solution  and  the  iterative  method  in 
[7]  are  implemented  in  our  prototype.  For  the  closed  form 
solution,  the  formulation  in  the  previous  section  is  directly 
implemented.  For  the  iterative  method,  the  heavy  computation 
is  required  for  the  algorithm,  and  singular  value  decomposi¬ 
tion  (SVD)  of  2  x  2  matrix  is  the  main  processing  to  find 
eigenvectors.  To  reduce  the  complexity  of  SVD  and  to  be 
able  to  efficiently  extend  our  implementation  to  more  practical 
form,  e.g.  the  embedded  system  with  C++,  the  typical  two  step 
implementation  scheme  from  [21]  is  used.  The  first  stage  is 
to  decompose  the  target  matrix  into  a  real  bidiagonal  matrix. 
The  second  stage  is  to  find  the  SVD  of  this  bidiagonal  matrix. 

SVD  can  be  written  as  follows. 


B  =  USVfl  (15) 

Here,  B  is  the  vector  to  be  decomposed,  U  and  V  are 
unitary  matrices  and  S  is  a  diagonal  matrix  with  singular 
values.  In  this  paper,  we  provide  a  brief  summary  of  the 
algorithm,  and  the  details  are  in  [21].  In  the  first  stage,  we 
transform  the  target  matrix  as  follows. 


B  =  Ui 


/ 

0 


9 

h 


Vi  =  UiDVi. 


(16) 


Here,  the  elements  of  D,  /,  g  and  h  are  real  values. 

In  the  second  stage,  the  SVD  of  the  real  bidiagonal  matrix 
D  is  found  as 


D  =  U2EV^.  (17) 

With  (16)  and  (17),  the  final  eigenvectors  U  and  V  become 


U  =  UiU2  (18) 

V  =  V1V2.  (19) 


B.  Operation  Scheduling 

There  are  three  main  phases  in  IA  operation  scenario  : 
training,  feedback  and  data  phases.  Figure  3  summarizes  the 
signals  that  are  needed  to  be  transmitted  at  each  of  the  phases, 
and  also  shows  the  tasks  that  should  be  done  during  each  of 
the  phases  at  the  transmitters  and  the  receivers. 
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Fig.  3.  Operation  scenario  for  the  prototype. 


•  Training  Phase  :  This  is  mainly  for  CSI  measurement 
at  the  receivers.  The  synchronization  training  is  sent 
from  the  transmitters  followed  by  the  CSI  training.  The 
receivers  first  synchronize  to  the  transmitters  in  time 
and  frequency  with  the  received  synchronization  training, 
and  then  measure  CSI  which  should  be  given  back  to 
the  transmitters  at  the  feedback  phase.  Furthermore,  the 
synchronization  among  the  transmitters  should  be  done 
at  this  phase.  The  details  about  the  training  structure  and 
the  transmitter  synchronization  is  given  in  the  following 
section. 

•  Feedback  Phase  :  This  phase  is  for  CSI  feedback.  Each 
of  the  receivers  makes  a  packet  which  is  consisted  of  the 
synchronization  training,  the  CSI  training,  and  the  CSI 
feedback  data,  and  the  packet  is  sent  to  the  transmitters 
via  over-the-air  transmission.  The  packets  are  sent  in  time 
orthogonal  manner.  After  receiving  and  decoding  these 
packets,  the  transmitters  calculate  IA  precoding  vectors. 

•  Data  Phase  :  This  phase  is  for  the  precoded  training 
and  data  transmission.  The  synchronization  training,  the 
precoded  CSI  training,  and  the  precoded  data  are  sent  to 
the  receivers  in  order.  The  receivers  first  synchronize  to 
the  transmitters,  calculate  the  decoding  vectors  with  the 
precoded  CSI  training,  and  decode  the  precoded  data  with 
them.  Finally,  the  sum  rate  is  calculated  and  the  operation 
ends. 

C.  Over-the-air  synchronization 

The  master-slave  protocol  is  used  for  over-the-air  transmitter 
synchronization  in  our  prototype,  but  there  are  two  differences 
from  the  algorithms  in  [17]  and  [18],  First  of  all,  the  phase 
synchronization  in  [18]  is  not  required  in  our  prototype  since 
it  does  not  affect  IA  performance.  Also,  time  synchronization 
is  simplified  from  [17].  The  ah'  propagation  delays  between 
nodes  are  assumed  to  be  the  same.  This  assumption  is  possible 
since  the  sample  rate  of  our  system  is  1MHz,  which  means 
that  the  sample  duration  is  long  enough  to  avoid  channel  delay 
spread,  and  the  nodes  are  not  physically  apart  from  each  other 
by  long  distances  to  cause  a  symbol-level  difference  in  the 
air  propagation  delays.  With  this  assumption,  there  is  no  need 
to  measure  the  ah  propagation  delays.  Then,  the  processing 
delay  and  the  turn-around  delay  at  both  the  transmitters  and  the 
receivers  do  not  matter  only  if  the  following  two  requirements 
are  guaranteed  :  1)  a  symbol  should  be  sent  from  transmitting 
antennas  exactly  at  desired  time  at  the  transmitters,  and  2) 
the  exact  time  when  a  symbol  arrives  to  receiving  antennas  is 
known  at  the  receivers.  These  requirements  are  supported  by 
our  implementation  hardware  and  software. 
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Fig.  4.  Transmitter  synchronization  in  our  prototype. 


Figure  4  illustrates  our  protocol.  For  our  distributed  system, 
it  is  assumed  that  each  of  the  nodes  are  physically  separated, 
and  this  means  that  every  node  works  with  its  own  time  and 
frequency  references  not  knowing  the  others’  references.  At 
the  training  phase  which  is  the  starting  point  of  one  turn  of 
IA  procedure,  the  synchronization  is  done  in  the  following 
three  steps, 

•  Step  1.  The  master  transmitter  first  sends  its  training  to 
the  slave  transmitters  at  time  zero  with  its  local  timer. 
When  each  of  the  slaves  receives  and  detects  the  training, 

fnl 

it  knows  its  local  time  Tg  '  when  the  first  sample  of  the 
training  is  arrived  at  its  antennas.  Here,  n  is  the  slave 
index.  The  slaves  also  measure  the  frequency  offset  from 
the  master’s  frequency  with  this  training. 

•  Step  2.  Tc  is  the  waiting  time  for  synchronized  transmis¬ 
sion,  and  it  is  known  to  all  transmitters.  The  master  and 
the  slaves  wait  until  their  local  time  reaches  Tc  for  the 
master,  and  Tg  ‘  +  Tc  for  the  slaves. 

•  Step  3.  All  the  transmitters  send  their  own  training  to 
the  receivers  immediately  when  their  waiting  time  ends. 
Each  of  the  slaves  recovers  the  measured  frequency  offset 
from  its  own  training  before  it  is  sent.  The  training  and 
data  are  now  synchronized  in  time  and  frequency. 

The  same  operation  is  performed  at  the  data  phase  since 
the  time  between  the  training  phase  and  the  data  phase  may 
not  short  enough  for  the  slaves  to  maintain  their  previous 
synchronization.  At  the  data  phase,  the  starting  time  of  the 
data  phase,  TD,  is  only  known  to  the  master  and  it  sends 
its  training  again  to  the  slaves  at  that  time.  The  slaves  now 

fnl  fnl 

detect  it  at  time  Tg  +  TD  ,  and  all  transmitters  send  the 
synchronization  training  and  data  after  Tc  time.  Besides 
the  transmitter  synchronization,  the  receivers  also  need  to 
synchronize  to  the  transmitters.  Each  receiver  synchronizes 
independently  to  the  transmitters,  and  there  is  no  coorperative 
synchronization  among  the  receivers. 

An  auto-correlation  method  in  [22]  is  used  for  synchro¬ 
nization  at  all  nodes.  Since  the  synchronization  training  has 
repeated  patterns  in  time  domain,  the  auto-correlation  between 
two  repeated  patterns  F,/  becomes  as  follows  : 


I*  |2  IS  Vk(d  +  l)yl(d  +  l  +  G)\2 

Td  =  =  — - — ■  (20) 

Lf:\yk{d  +  l  +  GY\ 

i= o 

Here,  d  is  the  current  received  sample,  Lrep  is  the  correla¬ 
tion  length  and  G  is  the  length  of  a  pattern.  The  normalized 
auto-correlation  is  the  ratio  of  the  auto-correlation  <!»,/  and 
the  power  Pd .  The  exact  time  synchronization  point  ds  is  the 
d  that  maximizes  I’,/.  The  frequency  offset  is  obtained  from 
the  arctangent  value  of  r,js. 

D.  CSI  Feedback 

The  main  feedback  scheme  in  this  paper  is  analog  feed¬ 
back.  Under  the  assumption  of  flat  fading  channel,  receiver  i 
measures 

H  ij,  j  =  1,2,3.  (21) 

Since  each  Hy  is  2  x  2  matrix,  there  are  K  x  Nt  x  Nr 
complex  values  for  the  measured  CSI  at  each  receiver.  These 
values  are  mapped  to  Nsc  subcarriers  by  Nsc  x  (K  x  Ntx  Nr) 
mapping  matrix,  and  the  Nsc  subcarriers  form  an  OFDM 
symbol.  As  a  result,  each  receiver  has  an  OFDM  symbol 
for  analog  feedback.  The  transmitters  that  receive  the  OFDM 
symbols  apply  the  pseudo  inverse  of  the  mapping  matrix  to 
the  received  OFDM  symbols,  and  find  the  CSI  values.  The 
transmit  power  which  decides  the  difference  in  SNR  between 
feedforward  and  feedback  channel  as  in  (14)  is  a  design 
parameter. 

For  limited  feedback  as  a  competitor,  the  explicit  feedback 
beamforming  method  from  the  beamforming  specification  of 
802.1  In  wireless  LAN  is  used  [23].  With  the  quantization 
method,  Ngain  bits  are  used  for  the  amplitude  of  CSI,  and  Nq 
bits  are  used  to  quantize  each  of  real  and  imaginary  values  of 
the  measured  CSI.  The  integer  amplitude  index  Mi  in  dB  for 
receiver  i’s  CSI  feedback  is 

Mi  =  min{ -  1,  |20Zogi0(— ^)J}.  (22) 

nii 

Gref  is  the  maximum  gain  reference,  and  rrii  is  the  maxi¬ 
mum  absolute  value  among  all  the  real  and  imaginary  values 
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of  H ij,j  =  1,2,3.  [x\  is  the  largest  integer  smaller  than  or 
equal  to  x.  Mi  index  is  given  to  the  transmitters  via  feedback 
channel. 

The  real  and  imaginary  values  of  H)?  are  quantized  and 
normalized  to  H qscaledp^  which  is  consisted  of  Nq  bits  with 
two’s  compliment  fixed  point  number  representation  as 

R< H?-)  =  round  ^  1))  (23) 

Im{ H?j)  =  round  ( ~  1))  •  (24) 

Here,  M,-  is  the  normalization  factor  given  by 
Gref  1 10Mi/20.  For  each  receiver,  the  total  number  of  bits  to  be 
transmitted  as  CSI  feedback  is  Ngain  +  2  x  Nq  x  K  x  Nt  x  Nr. 
These  bits  are  mapped  to  QPSK  modulation  and  OFDM 
subcarriers. 

The  transmitters  that  receive  the  CSI  feedback  first  decode 
Mi  and  Hreceived(ij)  by  QPSK  and  OFDM  decoding,  then 
they  find  the  CSI  by 


Re(Hr 
Im(  Hre 


ve  d(ij)) 
e  d(ij))  ~ 


M  H?-) 
10MV20 

Mh?.) 

10M,/20  • 


(25) 

(26) 


N gain  =  3  and  Nq  =  4,  5, 6  and  8  for  our  implementation. 
Gref  is  chosen  to  minimize  the  mean  squared  error  (MSE)  by 
quatization. 

The  quality  of  CSI  feedback  is  also  affected  by  Doppler 
frequency.  To  measure  the  performance  of  our  system  from 
Doppler  effect,  the  Doppler  effect  model  by  memoryless 
Markov  process  is  used  as  an  example  in  [24].  Assuming  the 
channel  is  Rayleigh  block  fading  channel,  the  channel’s  time 
varying  property  by  Doppler  effect  is  modeled  with  Markov 
process  as. 
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Fig.  5.  Packet  structures. 
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MHz  in  our  prototype  because  of  hardware  constraints.  Both 
feedforward  and  feedback  channel  use  the  same  frequency,  and 
the  system  is  a  TDD  system.  OFDM  is  used  for  the  training 
and  data,  and  only  the  synchronization  training  is  generated 
at  time  domain.  The  OFDM  parameters  are  summarized  in 
Table  I. 


H ij[n\  =  p&ij[n  -  1]  +  e[n}.  (27) 

Here,  p  is  the  channel  correlation  coefficient,  and  e[n]  is 
the  channel  error  that  has  complex  normal  distribution.  The 
constraint  for  p  and  e[n\  is  p2  +  e[n]2  =  1.  If  fd  is  Doppler 
frequency  and  Ts  is  the  symbol  duration,  p  is  given  as  p 
Jo(27t/£;Ts),  where  Jq(x)  is  the  0-th  order  Bessel  function  of 
the  first  kind.  It  can  be  seen  that  if  p  =  0,  the  channels  at  n 
and  n  —  1  are  completely  independent.  On  the  other  hand,  if 
p  =  1,  the  channel  is  perfectly  known  to  the  transmitters. 

IV.  Experiments  and  Results 


table  i 

OFDM  PARAMETERS  FOR  THE  PROTOTYPE. 


FFT  length 

128 

Cyclic  Prefix  Length 

32 

Number  of  null  subcarriers 

23 

Number  of  data  subcarriers 

105 

The  system  is  designed  to  operate  at  2.4  GHz,  the  bandwidth 
would  ideally  be  large  as  20  MHz,  but  it  is  lower  as  1 


A.  Experiment  setup 

As  described  in  the  previous  section,  our  system  works  in 
three  phases  :  the  training,  the  feedback  and  the  data  phases. 
For  each  of  the  phases,  the  packet  structure  is  shown  in 
Figure  5.  In  Figure  5,  the  packet  structures  for  the  training 
phase  (Figure  5(a)),  the  feedback  phase  (Figure  5(b)),  and  the 
data  phase  (Figure  5(c))  are  given.  There  are  three  types  of 
trainings  in  our  system: 

•  Synchronization  training  :  The  synchronization  training 
is  for  the  nodes’  synchronization.  It  has  two  parts  :  short 
training  for  coarse  time  synchronization  and  long  training 
for  fine  time  and  frequency  synchronization.  A  length  17 
Zadoff-Chu  sequence  is  repeated  five  times  as  the  short 
training,  and  a  length  29  Zaduff-Chu  sequence  is  repeated 
three  times  as  the  long  training.  The  training  has  repeated 
patterns  in  time  domain,  and  the  synchronization  is  done 
by  the  auto-correlation  method  in  (20). 

•  CSI  training  :  The  CSI  training  is  for  CSI  measurement 
to  be  sent  back  to  the  transmitters  for  IA  precoding 
vector  calculation.  One  CSI  training  is  an  OFDM  symbol, 
and  it  is  sent  from  each  of  the  transmitting  antennas 


Fig.  6.  Software  and  hardware  configuration  of  the  prototype. 

in  time  orthogonal  manner  as  in  Figure  5(a).  As  such, 
six  OFDM  symbols  are  transmitted  for  our  three  user 
2x2  MIMO  system.  The  training  does  not  experience 
precoding  nor  decoding.  Thus,  the  receivers  can  find  the 
pure  condition  of  current  wireless  channel.  With  this 
training,  each  receiver  measures  2x6  CSI  matrix. 

•  Precoded  training  :  The  precoded  training  is  sent  at 
the  data  phase,  and  it  experiences  IA  precoding  at  the 
transmitters,  where  each  user  in  our  three  user  system 
has  one  data  stream  that  is  mapped  to  two  antennas 
via  the  precoding  vector.  It  is  used  for  the  decoding 
vector  calculation.  Also,  since  this  training  is  precoded 
in  the  same  way  with  the  precoded  data,  it  is  used  to 
equalize  the  data  symbols.  The  training  is  also  an  OFDM 
symbol,  and  it  is  sent  from  two  transmitting  antennas  of 
a  transmitter  at  a  time.  Each  transmitters  sends  it  in  time 
orthogonal  manner  as  in  Figure  5(c). 

B.  Software  and  hardware 

Figure  6  shows  the  hardware  and  software  configuration 
of  our  prototype.  The  system  uses  two  computers  that  are 
equipped  with  dual-core  Intel  Xeon  2.67  GHz  processors 
and  12GB  of  memory.  One  computer  is  used  to  control 
the  trasnmitters,  and  the  other  is  for  the  receivers.  National 
Instruments  (NI)  USRP-2921  [25]  is  used  for  analog  to  digital 
convertor  (ADC)  /  digital  to  analog  convertor  (DAC)  and  radio 
frequency  (RF)  front-end.  One  USRP  works  as  one  antenna,  so 
two  USRPs  are  used  for  a  node,  and  accordingly  one  computer 
controls  six  USRPs.  A  USRP  can  switch  between  transmitting 
mode  and  receiving  mode,  e.g.  the  USRPs  for  the  transmitters 
work  in  transmitting  mode  for  the  training  and  data  phases,  but 
in  receiving  mode  for  the  feedback  phase.  The  computer  and 
the  USRPs  are  connected  via  TCP/IP  connections.  National 
Instruments  (NI)  LabVIEW  which  is  a  model-based  digital 
signal  processing  implementation  software  [26]  is  used  for  our 
software  defined  radio  (SDR)  implementation.  The  control  of 
USRPs  is  also  done  by  LabVIEW’s  USRP  driver.  Though  only 
one  computer  is  used  for  all  the  transmitters,  and  similarly 
only  one  computer  is  used  for  all  the  receivers,  the  signal 
processing  for  each  node  is  completely  separated  from  each 
other  not  to  violate  the  interference  channel  model,  i.e.  there 


Fig.  7.  Hardware  configuration  of  one  node. 

are  three  parallel  independent  processing  chains  in  LabVIEW 
on  a  computer. 

Since  our  system  is  distributed  system,  each  node  should 
work  with  its  own  oscillator.  The  GPS  disciplined  oscillator 
(GPSDO)  module  that  can  be  attached  to  USRP  is  used  as 
the  oscillator  for  a  node  [27].  The  GPSDO  is  originally  for 
the  global  synchronization  to  the  GPS  signal  which  is  only 
available  at  outdoors.  There  is,  however,  no  assumption  of 
outdoor  operation  for  IA  system  in  general,  so  the  GPSDOs 
are  not  used  to  synchronize  to  the  global  GPS  signal.  Instead, 
a  GPSDO  is  used  to  provide  time  and  frequency  references 
only  to  a  single  node  in  our  prototype.  Though  the  GPSDOs 
are  used,  our  distributed  synchronization  algorithm  is  designed 
assuming  that  each  of  the  GPSDOs  generates  the  independent 
time  reference  (pulse  per  second  (PPS))  and  frequency  refer¬ 
ence  (10MHz)  without  being  synchronized  to  the  GPS  signal 
since  our  prototype  is  tested  in  indoor  environment  where  the 
global  GPS  signal  is  unavailable.  The  PPS  and  10MHz  are 
the  required  time  and  frequency  references  for  the  USRPs 
respectively.  Six  GPSDOs  are  used  because  there  are  six  nodes 
in  the  system,  and  the  time  and  frequency  reference  signals 
from  a  GPSDO  are  only  shared  between  two  USRPs  which 
form  a  node’s  two  antennas.  Figure  7  shows  the  hardware 
configuration  of  a  node  in  the  prototype.  The  MIMO  cable 
between  the  USRPs  in  Figure  7  is  only  used  for  the  ethernet 
connection  of  two  USRPs  to  a  computer. 

Figure  8  shows  the  schematic  of  distributed  test  environ¬ 
ment.  Both  the  transmitters  and  the  receivers  are  physically 
separated  to  verify  our  system.  To  make  the  SNRs  of  the 
air  links  be  similar  as  possible,  the  distances  between  any 
transmitter  and  any  receiver  are  made  to  be  the  same  as 
possible.  Also,  the  AWGN  generator  which  is  used  to  scale 
the  overall  SNR  of  the  system  is  located  at  the  center  of  the 
nodes. 

C.  Results 

In  this  paper,  the  sum  rate  of  distributed  IA  with  analog 
feedback  is  first  presented.  Then,  more  detailed  measurements 
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Fig.  8.  Test  environment  of  distributed  nodes. 
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Fig.  9.  An  example  constellation  of  analog  feedback. 


on  the  relationship  between  the  accuracy  of  IA  solution,  resid¬ 
ual  frequency  offset,  and  CSI  feedback  are  also  given.  For  all 
the  results  in  this  paper,  the  distributed  nodes’  synchronization 
protocol  is  applied. 

To  scale  SNR  of  the  system,  Agilent  E4438C  vector  signal 
generator  is  used  [28].  A  white  Gaussian  complex  random  data 
with  5,000  samples  is  generated  from  Matlab  and  uploaded 
to  the  signal  generator,  and  the  data  is  emitted  repeatedly  at 
2.4GHz.  By  controlling  RF  gain  from  the  signal  generator, 
SNR  can  be  changed,  and  it  is  measured  at  the  receivers.  SNR 
measurement  is  done  by  measuring  the  noise  power  from  CSI 
training  at  frequency  domain.  After  the  channel  estimation  is 
done  at  frequency  domain  by  FFT,  this  pre -channel  estimation 
result  is  transformed  into  time  domain  samples  by  IFFT. 
Now  the  samples  are  channel’s  impulse  responses,  and  the 
first  few  samples  show  the  channel  power  and  the  following 
samples  show  noise  power.  By  nulling  the  noise  samples  and 
performing  FFT  again,  the  noise  free  post-channel  estimation 
result  is  obtained.  The  SNR  is  defined  by  the  ratio  of  the 
signal  power  from  noise  free  post-channel  estimation  result 
and  the  noise  power  from  the  difference  between  pre  and  post 
channel  estimation  results.  The  same  transmit  power  is  used 
both  for  feedforward  and  feedback  channel,  so  the  SNRs  for 
both  channels  are  the  same. 
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(b)  Comparison  of  measured  sum  rate  to  simulated  sum  rate. 
Fig.  10.  I A  performance  with  analog  feedback 


Figure  9  shows  an  example  of  a  packet’s  received  CSI  at 
the  transmitters  with  analog  feedback  when  the  SNR  is  30dB. 
Figure  9  has  12  constellation  groups,  and  each  group  is  an 
element  of  2  x  6  {Hji,  H*2,  H.^}  channel  matrix  from  z-th 
receiver.  Four  different  points  in  a  group  show  the  received 
CSI  at  transmitter  1,  2,  3,  and  the  perfect  CSI. 

The  sum  rate  of  our  system  is  given  in  Figure  10.  Each 
dot  in  the  first  subfigure  is  the  measured  instantaneous  sum 
rate  within  SNR  range  from  15dB  to  35dB  and  the  solid  line 
shows  their  linear  fitting,  and  the  second  subfigure  shows  the 
comparison  of  the  linear  fitting  to  the  simulation  results.  The 
simulation  is  performed  under  different  setups  :  1)  perfect  CSI 
estimation  (CE)  and  feedback,  2)  only  with  feedforward  CE 
error,  3)  with  feedforward  CE  and  feedback  data  errors,  4) 
with  the  errors  in  3)  and  feedback  CE  error,  and  5)  with  all 
the  error  sources  :  feedforward  CE  error,  feedback  data  error, 
and  feedback  CE  error,  and  feedforward  precoded  CE  error. 
It  is  observed  from  Figure  10  that  the  multiplexing  gain  is 
maintained  by  analog  feedback  only  with  a  constant  sum  rate 
degradation  if  the  feedback  transmit  power  is  linear  to  the 
feedforward  transmit  power  for  CSI  measurement  as  analyzed 
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Fig.  1 1 .  Sum  rate  with  iterative  IA  method. 


Fig.  12.  I A  performance  with  the  residual  frequency  synchronization  error 

in  [9].  This  result  is  possible  since  the  SNRs  for  feedforward 
channel  and  feedback  channel  are  the  same  in  our  experiments. 

To  study  the  performance  with  imperfect  IA  solution.  Fig¬ 
ure  1 1  shows  the  sum  rate  versus  the  number  of  iterations  for 
the  iterative  I A  method.  The  iterative  method  from  [7]  is  used, 
and  the  number  of  iterations  is  an  important  design  parameter 
for  the  method.  The  mean  sum  rate  at  SNR  =  30dB  and  SNR  = 
20dB  are  measured  and  compared  to  those  from  the  simulation 
with  the  perfect  CSI  feedback.  Though  the  sum  rate  from  the 
prototype  is  smaller  than  that  from  the  simulation  because 
it  is  limited  by  CSI  estimation  and  feedback,  the  number  of 
iterations  to  achieve  the  sum  rate  is  not  reduced.  This  shows 
that  the  leakage  from  the  imperfect  IA  solution  is  independent 
to  the  other  error  sources  regardless  of  SNR  of  the  system. 

Figure  12  shows  the  IA  performance  with  synchronization 
error.  The  time  synchronization  error  of  IA  with  OFDM  does 
not  affect  the  IA  performance  only  if  the  timing  synchroniza¬ 
tion  is  done  only  within  cyclic  prefix,  and  this  requirement 
is  not  new  to  IA.  Frequency  synchronization,  however,  can 
not  be  perfect  in  the  real  systems,  and  there  always  remains 
the  residual  frequency  offset  after  synchronization.  It  causes 


(a)  Measured  sum  rate  with  limited  feedback  and  analog  feedback 


(b)  Sum  rate  vs.  MSE  of  limited  feedback. 

Fig.  13.  I A  performance  with  limited  feedback 

inter  channel  interference  (ICI)  which  can  not  be  recovered 
by  IA,  so  it  is  added  as  an  extra  error  source  to  the  system. 
Furthermore,  the  ICI  by  the  residual  frequency  offset  affects 
all  the  CSI  measurement  and  feedback  steps  which  are  listed  in 
Figure  10.  In  Figure  12,  the  sum  rate  is  plotted  with  different 
%  frequency  offsets  with  respect  to  the  subcarrier  spacing. 
The  same  frequency  offset  is  added  to  the  air  links  between 
the  master  transmitter  and  the  slave  transmitters,  and  to  the 
connection  between  the  transmitters  and  the  receivers.  The 
SNR  is  30dB. 

The  analog  feedback  is  also  compared  to  limited  feedback. 
For  limited  feedback,  the  uncompressed  explicit  scalar  quan¬ 
tization  in  802.1  In  specification  [23]  is  used  as  described  in 
the  previous  section.  Nq  =  4,  5,  6  and  8  are  applied  for  the 
evaluation.  It  is  observed  from  Figure  13(a)  that  the  slope  of 
sum  rate  increases  as  more  bits  are  allocated,  i.e.  as  feedback 
overhead  is  increased,  which  means  more  bits  are  needed 
as  feedback  as  SNR  gets  high  to  get  the  multiplexing  gain 
of  IA.  Figure  13(b)  shows  the  sum  rate  against  MSE  of 
limited  feedback.  The  fitting  line  is  the  linear  connection  of 
the  means  of  the  sum  rates  with  different  Nq.  The  results 
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Fig.  14.  I A  performance  with  Doppler  effect 

in  Figure  13  gives  us  some  interesting  observations.  First 
of  all,  the  multiplexing  gain  of  quantization-based  feedback 
increases  with  increasing  number  of  Nq,  which  corresponds 
to  the  previous  analysis  in  [8].  Furthermore,  compared  to  the 
quantization-based  feedback,  analog  feedback  has  higher  sum 
rate  performance  at  high  SNR  region  than  the  quantization- 
based  feedback.  This  is  because  the  MSE  of  analog  feedback 
is  reduced  with  increasing  feedback  SNR,  but  the  MSE  of 
quantization-based  feedback  is  fixed  regardless  of  SNR  if  it 
is  assumed  that  there  is  no  bit  error  for  quantization-based 
feedback. 

The  delay  in  CSI  feedback  also  causes  multiplexing  gain 
degradation  of  IA.  Figure  14  presents  the  sum  rate  measure¬ 
ment  result  versus  the  speed  of  nodes  at  30dB  SNR.  The 
measured  CSI  is  modified  with  the  p  and  e  from  Markov 
process  when  the  speed  of  node  is  given.  The  modified  CSI  is 
transmitted  to  the  transmitter  via  analog  feedback.  The  sample 
duration  Ts  =  1ms  is  assumed.  From  Figure  14,  it  is  obvious 
that  IA  is  very  sensitive  to  delayed  feedback  loosing  most  of 
sum  rate  gain  at  lower  speed  than  5  km/h. 

V.  Conclusion  and  Future  Work 

The  prototype  of  the  three  user  IA  system  with  2x2  antenna 
configuration  for  each  user  is  implemented.  With  the  proto¬ 
type,  the  performance  of  IA  with  the  practical  issues  is  studied, 
and  the  interesting  measurement  results  about  feedback  quality 
are  first  presented.  Our  IA  system  targets  interference  channel, 
so  the  nodes  are  assumed  to  be  physically  separated  and  to 
work  independently  without  coorperation  following  the  basic 
assumption  of  interference  channel.  It  is  achieved  by  develop¬ 
ing  over-the-air  time  and  frequency  synchronization  protocol 
and  feedback  methods.  The  synchronization  requirement  of  IA 
is  studied,  and  the  over-the-air  master/slave  synchronization 
method  is  developed.  Analog  feedback  is  implemented  as 
the  over-the-air  feedback  method,  and  the  performance  of 
IA  with  analog  feedback  in  real  world  is  verified  from  our 
prototype.  The  performance  of  IA  with  imperfect  IA  solution, 
synchronization  and  CSI  feedback  is  also  studied  providing 
us  the  chances  for  design  space  exploration  of  IA’s  design 


parameters.  Finally,  the  sum  rate  degradation  according  to 
mobility  is  measured  showing  a  critical  limitation  of  IA. 

To  move  forward  to  enable  IA  in  the  practical  applications, 
the  achievable  performance  of  IA  with  the  hardware  con¬ 
straints  in  the  real  world  such  as  the  limited  and  heterogeneous 
computing  capability  of  IA  network  as  well  as  the  limited 
energy  supply  of  portable  devices  needs  to  be  studied,  and  it 
remains  as  our  future  work. 
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